A Probabilistic Parser and Its Application
نویسندگان
چکیده
We describe a general approach to the probabilistic parsing of context-free grammars. The method integrates context-sensitive statistical knowledge of various types (e.g., syntactic and semantic) and can be trained incrementally from a bracketed corpus. We introduce a variant of the GHR contextfree recognition algorithm, and explain how to adapt it for e cient probabilistic parsing. In splitcorpus testing on a real-world corpus of sentences from software testing documents, with 20 possible parses for a sentence of average length, the system nds and identi es the correct parse in 96% of the sentences for which it nds any parse, while producing only 1.03 parses per sentence for those sentences. Signi cantly, this success rate would be only 79% without the semantic statistics.
منابع مشابه
Studying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملExtension of Cube Attack with Probabilistic Equations and its Application on Cryptanalysis of KATAN Cipher
Cube Attack is a successful case of Algebraic Attack. Cube Attack consists of two phases, linear equation extraction and solving the extracted equation system. Due to the high complexity of equation extraction phase in finding linear equations, we can extract nonlinear ones that could be approximated to linear equations with high probability. The probabilistic equations could be considered as l...
متن کاملA modification of probabilistic hesitant fuzzy sets and its application to multiple criteria decision making
Probabilistic hesitant fuzzy set (PHFS) is a fruitful concept that adds to hesitant fuzzy set (HFS) the term of probability which is able to retain more information than the usual HFS. Here, we demonstrate that the existing definitions of PHFS are not still reasonable, and therefore, we first improve the PHFS definition. By endowing the set and algebraic operations with a new re-definition of P...
متن کاملFeature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملRobust Probabilistic Predictive Syntactic Processing
of “Robust Probabilistic Predictive Syntactic Processing” by Brian Edward Roark, Ph.D., Brown University, May, 2001. This thesis presents a broad-coverage probabilistic top-down parser, and its application to the problem of language modeling for speech recognition. The parser builds fully connected derivations incrementally, in a single pass from left-to-right across the string. We argue that t...
متن کاملA Fire Ignition Model and Its Application for Estimating Loss due to Damage of the Urban Gas Network in an Earthquake
Damage of the urban gas network due to an earthquake can cause much loss including fire-induced loss to infrastructure and loss due to interruption of gas service and repairing or replacing of network elements. In this paper, a new fire ignition model is proposed and applied to a conventional semi-probabilistic model for estimating various losses due to damage of an urban gas network in an eart...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992